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CRAPTKH  I 


I f  TKODl' CiJ-  X ON  TO  THK  PROBLEM 
Life  cycle  analyses  and  tho  determination  of  system 
parameters,  such  a a  moan  time  to  repair  and  moan  time  be¬ 
tween  failures,  are  dependent  on  obtaining  sufficiently 
accurate  mid  tiiaoly  data  from  the  organizational  or  user 
unit.  The  Army  had  boon  collecting  this  data  under  TAKRS 
(Tho  Army  Engineering  Report  System)  and  is  presently  col¬ 
lecting  it  under  TAMMS  (Tine  Army  Materiel  Management  Sys¬ 
tem).  The  hope  of  these  systems  was  to  collect  all  infor¬ 
mation  on  failure  times,  repair  times,  and  corrective  actions 
on  nearly  evory  item  of  equipment  in  tho  Army  inventory. 

Tho  result  was  a  huge  quantity  of  mostly  useless  information 
that  required  much  paper  work  on  the  part  of  Army  maintenance 
men  and  data  collectors.  Computer  storage  of  nl^  the  data 
developed  into  a  monumental  task.  The  most  unfortunate 
aspect  of  this  system,  however,  is  tho  inability  to  get 
specific  management  information.  In  most  instances,  even 
if  tho  data  elements  such  ns  localization  timo  wero  rocordod, 
tho  data  is  subject  to  inaccuracy  and  omitions  sinco  in  many 
personnel  uefrainou  in  unta  collection  uXo  providing 
the  information. 

To  conserve  time  and  resources,  tlio  Army  has  now  decid¬ 
ed  to  discontinue  TAMMS  on  all  but  a  few  major  itoms  and  has 
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implemented  the  concept  of  lSpocIqI  Operational  Data  Collec¬ 
tion.  According  tc  AHC  Regulation  No,  75>0-l|3»  Paragraph  3B, 
Special  Operational  Cntu  Collection  is  defined  as; 

A  data  collection,  processing,  and  analysis  systom 
controlled  by  thn  equipment  proponent  and  specif¬ 
ically  designed  to  gatlior  aoevrato,  timely,  and  com¬ 
plete  data  necessary  to  ovaluato  the  performance  ef¬ 
fectiveness  of  Army  materiel  in  the  field. 

It  is  the  hopo  of  this  paper  to  present  an  organized  and 
general  approach  to  providing  reliability  and  maintainabil¬ 
ity  parameters  in  such  a  system.  Bocauso  of  the  great  size 
of  the  aroa  of  investigation,  only  a  gonorai  framework  and 
a  few  spoeific  examples  will  bo  presented  in  this  paper. 

The  problem  of  developing  a  general  sampling  plan  for 
this  system  is  complicated  by  the  amount  of  non-statistical 
externa.!  constraints.  The  basic  approach  will  bo  to  detail 
tho  decisions  necessary  to  arrive  at  a  sampling  technique 
and  to  fit  this  technique  into  tho  constraints  of  tho  specif¬ 
ic  problem.  These  decisions  nro  detailed  in  chapter  tiiroe. 

As  will  be  shown  in  chap tor  two,  most  literature  on 
sampling  techniques  pertains  to  acoeptancQ  sampling  whore  a 
forehand  knowledge  of  acceptable  and  unaoceptRbi e  character¬ 
istics  allows  fer  tho  development  of  soundly  founded  "optimal" 
accept one o  s sjup ling  p— < l*  e  mv/; y^u^od  in  tox ms 

of  .x  and  fi  risks  which  define  the  probability  of  rejection  ox 
items  with  acceptable  characteristics  (  <x  risk)  or  acceptance 
of  items  with  undosireable  characteristics  (  yd  risk). 
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The  baste  diiT^ronco  in  designing  sampling  plans  for 
maintenance  information  rotriovai  is  that  there  are  no  ac¬ 
ceptable  or  unaco  op  table  chnrao  tr  oistJ  os  on  which  to  design 
a  tost.  This  doos  not  mean  the  statistical  techniques  used 
in  dotormining  sample  sizo  ai'o  different  frrr.  nccoptmon 
sampling  boclmiquos.  It  .just  moans  that  one  must  havo  a 
diffoi’ont  terminology  and  a  different  approach.  One  must 
now  determine  confidence  intervals  for  the  pnz’amotors  of 
infcorest.  It  is  unfortunate  that  those  confidence  inter¬ 
vals  depend  on  both  anmplo  size  and  variance.  It  will  bo 
shown  that  oven  when  tho  mean,  variance  and  distribution  are 
assumed  known,  the  task  of  providing  the  most  cost-effective 
sampling  plan  is  prodigious.  Tho  information  which  is  re¬ 
quired  by  tho  Army  when  requesting  a  special  data  collection 
plan  is  as  follows:^ 

1.  Security  guido. 

2.  Project  description. 

3.  Basic  characteristics  to  bo  measured. 

4.  A  matoriol  doscripti.cn, 

5.  Proposed  start  and  completion  dates. 

6.  Typo  of  units  oxpoctod  to  participate, 

7*  Geographical  locations. 

0,  Resources  estimate o. 


1  AHC  KT,XrULAT  I  ON  NO.  7^0-43#  22  April  7C 


CHAiTiiK  II 


LjT.'jrATU  ,E  liKARCil 

Thu  goal  of  thu  .litoraturo  uearch  will  bo  to  do  tor. nine 
tho  ba.iio  Koliabilifcy  and  I'miixtuLuability  parvunotors  to  bo 
nciiauroo,  determine  and  dofino  tho  duln  elements  to  bo  col- 
looted,  and  provide  methods  to  doton.uno  sampling  plans. 
Also,  tho  problems  of  oxtcrnal  factors  such  as  geographical 
location,  sampling  costs,  and  other  oxtornal  constraints 
will  to  con3idox’od  in  tho  Maintainability  and  Reliability 
testing  program. 

Reii ability  is  defined  in  AUINC  (1)'  ns: 

Reliability  is  the  probability  that  a  system 
will  poi’form  satisfactorily  for  at  leant  n  given 
period  of  time  wbon  used  uxidor  statod  condi¬ 
tions. 

In  order  to  apply  this  definition  of  Reliability,  one 
must  dofino  what  a  fu.i3.uro  is  and  tho  conditions  xuidor  which 
the  it on  is  to  bo  used  (mission,  geographical  icontion, 
typos  of  personnol). 

Tho  reliability  function  is  tho  sumo  probability  ox- 
prossod  as  a  function  of  timo.  It  is  one  of  tho  pai’oxnotcrs 
which  may  bo  used  to  dofino  reliability  in  a  quantitative 
fashion.  It  is  usually  measured  by  a  survival  cure. 


1  Tho  numboi'  in  tho  parenthesis  is  the  location  of  the 
text  in  tho  reference  list. 
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However,  because  of  the  increasing  importance  or  the 
ay  atom  oharnctoristio  known  as  availability,  a  moro  popular 
measure  of  reliability  is  Moan  Timo  Botwoon  Failures  (MTBF). 
Tt  has  boon  pointed  out  in  AHINC  (1),  however,  that  when  one 
does  specify  this  parameter  ono  also  must  specify  the  dis¬ 
tribution  of  time  between  failures  to  go  with  it*  MTBF  by 
itself  is  not  a  moasuro  of  reliability.  It  is  this  param¬ 
eter  MTBF,  along  with  tho  maintainability  parameters,  for 
which  tests  will  bo  primarily  designed. 

Maintainability  i3  dofinod  in  MTf.-MTD  77®  ns  follows: 

Maintainability  i3  a  charnctoristi c  of 
design  and  installation  which  is  expressed 
as  tho  probability  that  nn  It on  will  conform 
to  specif iod  conditions  within  a  givon  period 
of  time  when  maintenance  action  is  performed 
in  accordance  with  prescribed  procedures  and 
rosourcos, 

Tho  poriod  of  time  required  to  rostoro  an  item  to  a 
spocifiod  condition  is  callod  down  time.  Down  timo  has  been 
broken  into  many  parts.  The  two  major  parts  being  active 
down  timo  and  delay  down  time.  In  active  down  time,  one 
usually  makes  a  distinction  botwoon  preventive  (schodulod^ 
down  timo  and  corrective  down  timo.  Corrective  down  time 
may  also  bo  brokon  down  into  localization,  isolation,  repair- 
replace,  and  chock  out  time.  It  is  therefore  very  important 
to  dotormino  which,  if  not  all,  of  those  times  are  important 
to  tho  probloro  at  hand  and  design  tho  data  oolloction  system 


to  fit  those  neods 
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Tho  luuintn  inabi  lity  parameters  usually  monsured  tiro 
He  an  Tine  to  Repair  ( M'l'TH ) ,  Moan  Preventive  Action  Time 
Moon  Active  Corrective  and  Preventive*  Action  Time 
(M),  rukl  of  courso  Mean  Down  TJ.no  (MDT).  It  must  tiiao  be 
lcopt  in  mind  that  those  parameters  do  not  havo  a  full  moan¬ 
ing  uv»t j  1  tho  distribution  of  times  in  specified. 

MTTR,  Mpt>  and  H  aro  usually  obaoi  vod  to  bo  log  normal¬ 
ly  distributed.  Therefore  by  taking  tin  logarithm  of  each 
repair  t  imo,  tho  resulting  distribution  la  usually  normal, 
Caro  must  bo  talcon  when  using  this  technique  since  tho  anti- 
log  of  tho  moan  of  the  normal  is  not  tho  moan  of  tho  log 
normal,  but  the  median  of  the  log  normul.  Delay  down  time 
docs  not  poososs  this  behavior,  and  it  is  thrr-jforo  a 

good  pi’actico  to  separate  at  least  the  delay  times  iroxn 
active  repair  times.  Delay  time  is  usually  further  subdivid¬ 
ed  into  logistics  time  and  administrative  time.  The  c omp ar— 
ison  of  Moan  Dolay  Timo  to  MTTR  is  also  very  valuable  to  the 
maintainability  engineer  in  determining  whether  system  (item) 
design  is  inadequate  or  whether  repair  facility  lay  out  or 
sparing  levels  nro  inadequate. 

The  majority  of  tho  statistics  necessary  for  the  detor- 
minaticn  of  a  sampling  plan  and  tho  analysis  °f  the  data  is 
presented  in  Quality  Control  and  Industrial  Statistics,  by 
Duncan  (3),  Most  sampling  plans  prosontod  in  this  text  ore 
acceptance  sampling  plans  or  test  of  hypotaosis.  Those  plans 
are  concerned  with  whether  a  sample  statistic  falls  within 


an  interval  or  outside  an  interval.  Of  intorost  to  this 
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paper  are  the  methods  of  determining  this  Interval*  hopefully 
prior  to  the  analysis  of  data. 

The  following  chapter  presents  the  logic  necessary  to 
arrive  at  a  sampling  technique. 


DECISIONS  NEEDED  TO  ARRIVE  AT  A  SAMPLING  PLAN 
Tho  following  suctions  hopg  propnrod  to  itomize  tho 


decisions  and  considerations  necessary  to  formulate  a  sam¬ 
pling  plan.  Only  two  ouch  piano  have  boon  considered  in 
this  paper.  Other  piano,  such  no  those  to  measure  maxi  - 
num  time  to  repair  or  combinations  of  tho  various  param¬ 
eters,  have  boon  excluded  since  tho  purposo  of  this  paour 
i3  to  present  only  a  general  framework  and  a  iVv;  spocific 
examples.  The  tvro  parameters  MTBF  and  NTTR  were  epos  on 
because  of  their  importance  in  do to mining  tho  availability 
of  a  pioco  of  equipment. 

Two  following  discussion  should  be  applicable  whothor 
one  is  looking  at  an  onliro  system,  a  subsystem,  or  just 
one  olomont  of  n  system.  It  should  bo  noted  that  tho  par¬ 
ticulars  of  the  plun,  sxich  no  tho  data  colloction  forms  and 
the  typo  of  data  collectors,  will  bo  dopondont  on  tho  item 
in  question,  but  tho  method  of  determining  sample  size  and 
sampli  \g  time,  which  are  presented  in  tho  following  chapters, 
should  not  bo  dependent  on  tneso  particulars* 

Tho  following  section  will  discuss  the  mean  time  be¬ 
tween  failuros  (MTBF)  parameter. 


Mo.ji  Time  Between  Failures 
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In  most  cases,  whore  a  roprosontativo  and  sufficiently 
largo  sample  exists,  the  distribution  of  the  sampled  mean 

time  botwoon  failures  should  be  approximately  a  normal  dis¬ 
tribution  about  the  true  moan  time  botwoen  failures.  It  is 
for  this  reason  that  MTBF  and  MTTR  are  particularly  nice 
parameters  to  moasuro.  If  one  knows  tho  variance  of  time 
between  failures,  one  may  always  determine  a  s, ample  3ize  N 
required  to  measure  MTBF  to  tho  dosirod  accuracy,  as  will 
be  shown  in  subsequent  chapter.  It  is  the  form  of  the  dis¬ 


tribution,  however,  which  will  determine  the  amount  of  time 
nocossary  to  accomplish  tho  N  failures  and  wi3.1  determine 
how  tho  data  is  analysed.  The  exponential,  time  between 
failures  will  be  discussed  in  Chapter  IV  and  tho  normal 
tiir.  j  between  failures  will  be  discussed  in  chapter  V. 

Figure  1  presents  tho  decisions  necossory  to  arrive  at 
a  dosirod  sampling  plan.  The  first  consideration  in  devel¬ 
oping  a  sampling  plan  concerns  the  suspension  of  scheduled 
maintenance.  If  tho  item  under  consideration  is  replaced 
or  parts  of  tho  item  are  replaced  or  repaired  in  tho  process 
of  scheduled  maintenance,  and  these  actions  will  interfere 
with  the  acquisition  of  the  required  data,  then  the  suspen¬ 
sion  of  scheduled  maintenance  may  be  required. 


If  the  item  to  be  sampled  is  a  new  system,  the  time 
between  failures  may  not  bo  charaot oris tic  of  that  item  in 
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later  operation.  it  is  for  this  reason  that  subsequent  cam¬ 
ples  nay  bo  doomed  noconsary. 

If  no  good  oatlxuato  of  t.hn  moan  is  available,  the  dura¬ 
tion  oi  the  toot  is  indoterminablo.  The  occurrence  of  this 
situation  is  not  usually  very  common  since  the  procurement 
and  design  phases  of  life  ‘vela  usually  provide  this  informa¬ 
tion,  If  this  case  does  arriso,  however,  one  may  choose  to 
tost  a  small,  handy  sample  for  the  moan  and  uso  this  as  an 
estimate  to  determine  this  parameter  with  a  higher  confidence 
in  a  subsequent  tost.  If  a  plan  \^ith  an  indefinite  termina¬ 
tion  date  is  acceptable,  tho  need  for  a  special  moan  tost  is 
not  necessary. 

The  confidence  with  which  tho  mean  is  measured  will  de¬ 
pend  on  tho  sample  variance  and  tho  size  of  tho  sample.  If 
a  target-  confidence  on  I-ITBP  oxists,  the  sample  size  will  be 
dotorminod  f i  orn  consideration  of  tho  variance.  If  no  esti- 

oV  "Li-'. o  variance  ic  .n v.ii  X .n V>  1  ^ }  "tlion  on o  f^nv* 

the  Variance.  If  the  sample  size  chosen  for  this  tost  was 
3uch  that  a  "t"  tost  providod  sufficient  confidence  on  the 
moan,  one  need  not  porfonu  a  socond  tost.  However,  this  is 
unlikely . 

i’inally,  ono  must  consider  tho  form  of  tho  distribution. 
If  the  form  con  be  assumod  exponential  or  normal,  then  the 
methods  of  Chapters  IV  and  V  can  be  applied.  If  the  distri¬ 
bution  is  known  to  bo  of  some  other  form,  a  plan  may  bo 
formed  using  the  considerations  of  Chapters  IV  and  V.  If  the 
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Figure  1.  DECISIONS  TO  DEVELOFE  A  HTBF  SAMPLING  PLAN 
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Mean  Timo  To  Repair 
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The  distribution  of  the  sampled  moan  time  to  repair  will 
also  approaoh  a  normal  distribution  t or  a  sufficiently  repre¬ 
sentative  and  large  sample.  The  sample  size  N  needed  to  ob¬ 
tain  the  desired  accuracy  is  easily  calculated  once  the  vari¬ 
ance  of  time  to  repair  is  known.  If  ono  wishes  to  induce 
failures,  no  problem  in  obtaining  this  required  number  is 
expected.  If  one  is  using  naturally  occurring  failures, 
however,  one  needs  an  estimate  of  the  failure  rate  to  deter¬ 
mine  the  time  necessary  to  observe  the  required  number  of 
failures. 

Figure  2  presents  the  decisions  necessary  to  arrive  at 
a  desired  sampling  plan.  The  first  decision  necessary  con¬ 
cerns  wheth  ir  the  item  is  newly  fielded  or  not.  Special 
considoration  must  be  given  to  a  newly  fielded  item  sinco 
the  repair  times  may  not  be  representative  of  those  times 

4-  „  T  „  +  -S  4-V.O  1  A  /-.e  tVio  Horn  fh-in  rmiflt  consider 

C^U  o  J.QWUA  W  X  **  V  X*  •  W»*V  V  V*  V  tAW  #•  v  v— •  ~ ■  - -  - 

performing  a  subsequent  test  to  determine  the  changes  in 
time  to  repair. 

Next,  we  mu3t  consider  the  form  of  the  distribution  of 
time  to  repair.  The  case  of  log  normal  time  to  repair  is 
covered  in  Chapter  VI.  Other  distributions  would  have  to 
bo  considered  on  their  own  merit. 

A  prior  estimate  of  the  mean  time  to  repair  is  not 
usually  necessary  in  determining  the  test  duration  if  the 
collection  of  the  data  uses  naturally  occurring  failures. 


In  tills  c (iso »  tho  time  constraint  is  usually  more  dnpondcnt 
on  tho  amount  oi'  timo  that  is  required  to  obtain  tho  necessary 
failures  than  it  is  on  tho  tins  required  to  perform  the  re¬ 
pairs,  If  tho  repair  timo  io  a  significant  factor  in  deter- 
mining  tho  tost  duration,  it  may  bo  nocoasary  to  sample  o 
small  convenient  population  to  got  an  estimate. 

Tho  sample  si 7.0  will  bo  dotonnined  by  the  desired  accur¬ 
acy  and  the  variance  of  tho  timo  to  repair.  If  no  estimate 
of  this  variance  is  available,  a  convenient  sample  should  be 
tested  for  the  variance.  If  the  mean  is  determined  to  tho 
desirod  accuracy  by  moans  of  a  "t"  tost,  further  sampling 
is  unnecessary. 

If  tho  MTTR  i3  to  bo  determined  within  a3  short  a  time 
as  possible,  artificially  induced  failures  should  bo  consid¬ 
ered,  There  are  sovoral  problomo  with  this  mothod.  The  first 
i3  determining  a  representative  set  of  failures  to  be  induced, 
since  this  will  effect  the  MTTR.  One  must  also  consider  tho 
facilities  nocos3ary  to  handlo  the  work  load. 

If  artificially  induced  failures  are  not  used,  an  esti¬ 
mate  of  tho  l'ailuro  rato  or  time  botwoon  failures  is  necessary 
to  determine  the  duration  of  n  tost.  A  convenient  sample 
may  be  tested  to  dotormino  this  paramotor  if  it  is  not  known. 
It  may  oven  be  nocoaaary  to  know  the  variance  of  time  between 
failuro3  if  tho  sample  size  is  not  sufficiently  large. 

The  next  chaptor  pro3ont3  the  considerations  necessary  to 
detorminc  a  sampling  plan  for  moan  timo  between  failures  when 


the  distribution  of  time  between  failures  is  normal 


CHAPTER  IV 


EXPONENTIALLY  DISTRIBUTED  TIME  BETWEHW  FAILURES 
Tbo  following  method  is  useful  in  determining  sample 
size  and  sampling  time  necessary  to  attain  desired  accuracy, 


when  the  distribution  is  assumed  to  be  exponential  and  one 
has  an  estimate  of  mean  time  betwoon  failures. 

One  can  assume  that  the  distribution  of  the  moan  will 
be  normal  for  a  sufficiently  large  sjouple  (N>15)  and  that 
the  variance  of  the  mean  will  be  equal  to  the  variance  of 
the  times  between  failure  divided  by  the  sample  sizo. 

Twn  r\  r  4*  V*  a  o  vr»  av*>  avi  4-4  4  4  Vw  \  4-  4  av\  4  n 

*-*v  *  v « «ik  va  ujav  wnv*»  ex  uxgui  au 


f(t)  =  (i/e)  o-t/e 


with  mean  =  0 

2 

and  variance  =  6  . 

Therefore  the  variance  of  the  normally  distrioutod  MTBF  will 

Q  _ 

be  0  /N  and  the  standard  deviation  will  be  0"  =  0/  -/N  , 

If  one  wishes  a  95%  confidence  interval  on  MTBF  the  interval 
will  bo  (from  normal  tables  Z  =  1.96) 

G  i  1.96  cr 


or 

9  (1  ~  1  .96/  A/tf  ) 

It  is  because  the  exponential  distribution  is  a  ono  parameter 
distribution  that  ore  gets  the  dependence  rf  varianoe  on  the 
moan. 
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Tlu.3  dependence  allows  un  to  determine 
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3 ample  size  in¬ 


dependently  of  tlio  moan.  It  is  accomplished  by  dividing  tho 
oonfidonoo  interval  by  tho  moim  giving  a  percent  confidence 
intorval.  For  example,  il'  ono  wishos  t.ho  truo  MTBF  to  lio 
within  1  0?»  of  tho  o a t i ma to,  with  confidence,  ono  calcu¬ 
lates  tho  sample  sizo  by  tho  following  mothod: 

Z>97pO/0  .10 

i.96  (e/VN  )/o  =  .10 

1.96  /\rli  -  .1 

VN  =  19.6 
N  =  3O4 

It  waa  by  this  nothod  that  Tablo  1  was  calculated  by 
U3ATE00M  ^  This  number  N  represents  tho  numb  or  of  fail- 
ui’os  tin  t  must  bo  obooi’vod  to  got  tho  desirod  accuracy. 

Tho  problem  now  is  to  dotormino  a  sumplo  si  so  and  a  sampling 
timo. 


Tho  expocted  number  of  failures  from  a  oninplo  of  3ize  n 
with  moan  0  in  a  period  of  timo  T  is 
N(T)  =  n(1  -  o-T/G) 

This  formula  assumes  sampling  without  roplncomnnt.  This 
formula  will  bo  accoptnblo  if  tho  probability  of  more  than 
ono  failure  is  small  (largo  n,  small  T).  If  moro  than  ono 
failure  por  item  ia  allowed,  a  different  approach  i3  acces¬ 
sary.  This  approach  will  be  covered  lator.  Tho  valuo  K(T) 
has  already  boon  determined  from  Table  1  and  a  curve  of  T/0 


TABLE  1 


20 


Failuro  Observation  Requirements  to 
Limit  Errora  in  MTDF  Eotima.too 


PERCENTAGE  ERROR 

IN  KTBF 

REQUIRED  FAILURE 

OBSERVATIONS 

PROBABILIT 

.85 

C  THAT  ERR( 
.90 

,)R  IS  NOT  I 

.95 

3XCEDED 

0  99 

5 

83O 

1082 

1537 

2655 

10 

207 

271 

384 

664 

15 

92 

120 

171 

295 

25 

33 

43 

61 

106 

35 

_ , 

17 

22 

31 

54 

Koto.  Assuming  an  exponential  distribution  of  times  botwoon 
failuros. 

Roferonoo:  Guidebook  for  Systems  Analysi3/Cost  Eff octiveno33 
USATECOM,  March  1969.1 


Inquirios  regarding  this  document  may  bo  forwarded  to  the 
Commanding  General,  USATECOM,  ATI  .I:  AMXRD-AMO. 
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•  The  functional  relationship  is 
T/e  a  -In  (1  -  N/n) 

If  tho  ooctt  naooolatod  with  time  in  0^  (T)  and  the  cost  as¬ 
sociated  with  sample  size  is  G_,(n),  tho  total  cost  will  bo 

(1. 

C  =  C.|  (T)  +  C-^n).  To  determine  a  particular  value  of  T 

and  n  one  must  minimise  tho  total  cost,  subject  to  the  con¬ 
i'! 

atraint  T  +  Oln  (1  -  ~  )  =  C*  To  accomplish  this,  roplaco 
T  by  -  (1/0)  In  (1-N/n)  in  0^  (T)  thorofero  producing  a  coat 
which  is  dependent  on  only  0110  variable  n. 

C(n)  =  C  (-  ( 1  /  0 )  In  ( 1  -N/n )  )  +  (n) 

I  1. 

To  dotornino  tho  maxima  or  minima  ^nluoa,  sot  the  derivative 
of  thi3  function  equal  to  r,oro.  A  negative  second  deriva¬ 
tive  implies  a  local  minimum.  If  there  is  more  than  one 
local  minimum,  chooso  tho  one  which  yields  the  lowest  total 
coot,  Caro  must  bo  taken,  however,  since  n  must  bo  an  integer. 

Tho  above  calculations  were  made  using  tho  expected 
number  of  failures.  It  is  therefore  reasonable  to  assume 
that  about  one  half  tho  time  this  method  is  used,  one  is 
not  going  to  receive  the  number  of  observations  necessary 
to  obtain  the  accuracy  for  which  one  has  plannod,  This 
occurs  bccauoo  the  distribution  of  tho  rumbor  of  evoxits  in 
a  time  interval  is  a  Poisson  distribution  and  for  most  cases 
can  be  approximated  by  a  normal.  If  having  tho  specified 
accuracy  13  very  important,  ono  would  like  to  design  tho  test 
in  a  way  that  one  would  have  a  high  confidence  of  meeting 
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tho  accuracy. 

The  distribution  of  events  in  an  interval,  when  tine 
between  events  is  exponential,  is  a  Poisson  distribution. 

The  meun  and  variance  of  this  distribution  is  N.  If  one 
approximates  tho  Poisson  by  a  normal,  which  is  a  good  approx¬ 
imation  for  large  N,  one  can  compute  an  effective  N.( 

131  =  N  +  Zy  N 

Whore  Z^>  is  tho  N(0,1)  coordinate  which  gives  the  desirod 
assurance^  .  One  may  now  uae  in  the  place  of  N  in  the 
previous  calculations.  The  cost  has  obviously  increased  to 
get  assurance  of  accuracy. 

The  question  is  now,  "How  good  was  the  initial  estimate 
of  KTBF?".  Obviously,  the  confidence  interval  had  to  be 
greater  than  what  was  desired  or  vher  would  bo  no  need  for 
further  sampling.  If  the  accuracy  desired  is  neoded  with  a 
high  probability  then  knowing  the  variance  of  the  estimate 
one  may  use  an  effective  mean  0O  such  that 

e0  ---  e  (i  +  Zg/  VhQ  ). 

Z^-is  the  normal  coordinate  for  the  assurance  desired  and 
is  tho  samplo  size  used  to  mako  the  first  estimate  of  the 
mean  0.  Tills  modification  will  also  cost  money  since  it 
will  increase  the  sampling  time. 

If  ones  allows  the  items  in  tho  sample  to  have  more 
than  one  failure,  a  decrease  in  sample'size  or  a  decrease  in 
the  sampling  time  nocessary  to  obtain  the  same  accuracy  can 
be  expected.  This  ooours  because  the  number  of  failures  to 


bo  observed  remains  constant  while  the  number  of  failures 
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per  item  increases.  Tlio  ©stimato  for  KTBF  in  this  Case  is 
0  -  (  nTr  -  Tr>  )  /  K 

Where  Tp  is  tho  duration  of  the  tost,  is  the  total  down 

time  of  cil.  1  failed  items,  n  is  tho  number  of  items  on  tost, 

and  N  is  the  number  of  failures.  Tho  quantity  2UG/G  i3  a 

Chi  Square  variable  with  2N  degrees  of  froodon^*'  .  For 

£-U 

Values  of  2K>30  ono  can  assume  that  tho  quantity 

ik'm  /  6  -  \/ijM  -  1 

has  a  normal  distribution  with  a  mean  0  and  a  variance  1. 

If  one  wishes  6  to  bo  \fithin  1  0?S  of  G  with  a  95>'£  confi¬ 
dence,  the  above  quantity  must  lie  between  -1.9b  and  +  1.96. 
If  G  -  0.9S  then  tho  normal  coordinate  is  -1.96  Mid 
ifk  N  (.9  0)  /  e  -  !fk  N  -  1  =  -1.96 
If  ono  nogloct3  the  -1  compared  to  largo  li,  one  calculates 
11  -  384*  Ono  will  note  that  this  is  tho  samo  solution  as 
before,  and  therefore  ono  is  allowed  to  usa  Table  1  to  de¬ 
termine  N.  Tho  difference  lies  in  the  fact  that  wo  now  have 
a  different  relationship  between  test  duration  and  sample 
size. 

T  =  (  K0  +  Td)  /  n 

A  sampling  plan  can  be  determined  in  much  the  same  manner 
as  for  the  ono  failure  per  item  approach,  tho  only  differ¬ 
ence  being  the  duration- sample  size  relationship.  One 
drawback,  howe/or,  is  tho  need  for  an  estimate  of  total 
down  time  of  all  failed  items.  An  estimate  of  this  par  am- 


etor  is 
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Td  =  NttMDT 

whore  MDT  in  tho  neon  down  time. 

The  final  approach  to  be  considered  is  to  terminate 
tho  sampling  plan  after  a  specified  number  of  failures.  This 
plan  has  tho  great  advantage  of  simplicity.  One  need  only 
look  in  Table  1  for  the  cut-off  number  for  tho  desired 
accuracy  and  choose  a  sample  size  (usually  about  the  size 
of  tho  cut-off  number).  The  big  disadvantage,  however,  is 
the  unpredictability  of  the  plan's  duration.  One  cannot 
schedule  work  loads  and  assignments  with  much  confidence. 

If  one  bus  no  prior  estimates  of  6,  however,  it  is 
probably  the  easiest  method  to  obtain  this  estimate.  Using 
a  rather  broad  confidence  interval  and,  therefore,  a  com- 
paritively  small  N  and  scmple  size,  one  can  sample  items 
conveniently  at  hand  and  in  a  hopefully  3hort  time  have 
the  estimate.  If  desired,  a  specified  termination  date 
override  may  be  used  to  prevent  extremely  good  items  from 
consuming  too  much  time.  The  full  sampling  problem  can 
then  be  approached  in  the  more  complete  manner. 

Tho  noxt  charter  will  present  methods  to  determine 
mean  time  between  failures  when  the  distribution  of  time 
between  failures  is  normal. 


CHAPTER  V 


NORMALLY  DISTRIBUTED  TIME  BETWEEN  FAILURES 

If  one  has  an  ostimato  of  the  vnrianco  of  tho  normal 


distribution,  the  number  of  failures  nocossary  for  any  do- 
sirod  confidence  interval  may  be  determined  from 

»  “  (  2Z(,.,VZ)  °/  R  >2  (i : 

whore  CTis  tho  ostimato  of  varicuico,  ^(i-c</2)  ^10  nor“ 

mal  coordinate  for  tho  do3ircd  confidence  and  R  is 

tho  difference  botwocn  tho  u}>pcr  and  lower  limits  of  the 
confidence  interval.  Tho  confidonco  interval  commonly  used 
is  95L  confidonco  interval,  whore  Z  =  1*96. 

If  one  ha3  an  ostimato  of  MTBp1  arid  one  determines  tho 


duration  of  a  sampling  plan  equal  to  this  ostimato,  the  ex¬ 
pected  number*  of  failures  would  equal  one  half  of  the  num- 

1 

ber  of  items  on  tost.  In  practice  this  would  not  provide 
a  very  effective  test  a inco  all  the  timos  to  failure  which 
would  have  oxceded  tho  moan  aro  not  observed,  thoroforo 
complicating  tho  analysis  of  data  and  making  it  difficult 
to  deto amino  if  the  underlying  distribution  was  actually 
normal.  The  most  desirable  condition  occurs  when  all  the 
items  on  tost  have  failed.  Tho  probability  an  item  will 
fail  in  a  time  T  is  %  if  T  =  MTBF  +  Z^CT  .  Tho  probability 


1  Ono  must  assume  hero  that  all  items  on  test  start 
at  time  zero  in  now  or  repaired  condition. 
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that  all  items  in  a  sample  of  size  n  will  fail  in  T  is  yn. 

It  i3  obvious  that  as  n  becomes  large,  one  should  not  oxpect 
all  items  on  tost  to  fall.  However*,  with  a  large  numbor  of 
data  points  covering  a  sufficient  rango  of  the  distribution, 
ono  nay  perform  a  least  squares  fit  to  determine  the  param¬ 
eters  of  the  distribution.  This  test  requires  that  the  data 
be  grouped.  Obviously,  the  last  group  would  consist  of  those 
items  that  have  not  yet  failed.  The  expectod  sizo  of  this 
group  is  (I-)1)  n,  is  the  sample  size. 

For  small  samples,  n  30,  or  samples  with  little  confi¬ 
dence  in  the  original  estimate  of  the  variance,  ono  would 
have  to  use  a  "t"  test  to  determine  the  confidence  interval. 

It  must  be  noted  that  only  ono  failure  per  item  is 
recorded.  The  sample  size  is  then  calculated  to  be  n  =  N/y . 

If  a  sample  of  size  n  is  not  available  or  is  inconvon- 
ient  to  test,  one  may  specify  that  each  item  on  test  fail  a 
specific  number  of  times  p.  This  requirement  must  bo  imposed 
since,  if  somo  items  fail  many  times  and  others  fail  fewer 
times,  one  will  bias  the  sample  moan  to  be  shot  ter  than  the 
actual  moan  of  the  undorlying  distribution.  The  sample 
size  is  now  n/p,  but  the  duration  of  the  test  must  now  bo 
increased.  Tho  moan  time  to  the  p  failure  is  p  times  the 
KT3F  and  tho  varianco  will  bo  p  times  the  estimate  of  time 
between  failure  variance.  One  may  use  this  moan  variance 
and  go  through  the  same  procedure  as  for  one  failure  per 
item  to  determine  tho  duration  of  the  tost. 
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Tf  ono  has  an  ootimsto  of  MTDP  but  no  estimate  ol‘  the 
Variance,  ono  ha3  no  good  niothod  by  which  to  do t ermine  a 
sample  size,  A  boat  gucaa  of  variance  may  bo  used  and  n 
plan  jaay  be  do signed  to  these  specifications.  Tho  confi¬ 
dence  interval  on  tho  mean  will  bo  determined  by  a  "t"  to3t. 
Ono  nay  havo  to  tako  a  second  sample  if  tho  desired  accuracy 
13  not  obtained.  One  could  also  have  tested  a  small  sample 
to  get  an  estimato  of  the  variance  and  then  designed  tho 
tost  using  this  estimate* 

A  procedure  similar  to  tho  ono  above  may  bo  usod  when 
one  has  neither  ostimato-  In  this  ense.  the  testing  of  a 
small  sample  is  probably  more  advantageous  than  guo3sing  a 
moan  variance.  In  the  latter  case,  the  probability  of 
having  to  tako  a  second  large  sample  is  greatly  incroasod. 

The  cost  trado-offs  to  bo  made  in  tho  above  plans  do 
not  lend  themselves  to  analytical  methods  of  solution.  IX' 
tho  cost  of  sampling  Gp (n )  and  the  timo  costs  C1 (T)  are 
known,  a  brute  force  mothod  of  minimizing  cost  may  be  usod. 

A  computer  program  could  be  designed  to  do  this. 

The  trade-off  between  accxiracy  and  total  cost  may  be 
performed  moro  easily.  It  is  a  reasonable  assumption,  in 
this  case,  that  the  total  cost  i3  a  function  of  Tnis 

is  true  3inco  the  sampling  duration  T  will  not  vary  much 
with  N  once  an  optimal  n  and  T  havo  been  determined.  The 
difficulty  here  lios  in  determining  the  worth  of  accuracy 
W(R)  and  tho  cost  relationship  C(N), 
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Tho  quantity  W(R)-C(N)  represents  tho  value  of  the 
information  after  the  coat  of  obtaining  this  information 
haa  been  ooneldered.  To  optimize  the  sampling  plan,  one  must 
maximize  thia  quantity.  The  relationship  betweon  R  and  N  is 
given  by  Equation  (1 )  and  the  solution  may  be  obtained  by 
simple  substitution  of  a  variable  and  solving  for  a  maximum. 

Finally,  one  must  consider  tho  e.go  of  tho  items  being 
sampled.  If  tho  parameter  to  be  measured  is  MTBF,  tho  sam¬ 
ple  should  include  all  ages  with  equal  probability.  If 
this  condition  is  not  met,  the  moan  measurod  i3  biased  by 
the  age  of  the  equipment.  It  is  thoroforo  important  to 
take  more  than  one  sample  when  one  is  dealing  uith  newly 
fielded  items.  Subsequent  samplos  can  be  scheduled  annually 
or  as  needed.  Tills  process  reveals  more  information  about 
the  instantaneous  failure  rate  of  the  systom. 

Tho  next  chapter  will  present  methods  to  dotormin© 


sampling 


plans 
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of  time  to  repair  is  log  normal. 


CHAPTER  VI 


LOG  NORMAL  TIT  IE  TO  REPAIR 

Tho  most  commonly  used  distribution  .for  timo  to  ropair 
is  the  log  normal  distribution.  Tho  KTTR  will  approach  a 
normal  distribution  with  n  variance  equal  to  tho  variance 
of  tho  tiino  to  ropair  distribution  divided  by  N  tho  sample 
size.  N  con  thoreforo  bo  calculated  by  the  mothod  of  chap¬ 
ter  V  if  tho  variance  is  known.  In  this  case,  however,  one 
need  not  he  concernod  that  tho  data  is  incoinplcto  3inco 

4.V.  ~  4- 1  ~  _  — . _  ^ ^  £  -i . — ^ 
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If  foiluroo  arc  to  bo  induced,  tho  tc3t  duration  will 
depend  only  on  tho  amount  of  time  necessary  to  induce  tho 
failure,  the  moan  and  maximum  time  to  repair,  and  the  num¬ 
ber  of  ropairs  that  can  bo  handled  at  one  timo.  One  can 
assume  tho  timo  to  induce  u  failui’e  is  negligible.  If  tho 
number  of  repaira  that  can  bo  handlod  at  one  time  i3  1©S3 
than  N,  ono  i3  facod  with  a  Quoueing  Theory  problem,  other¬ 
wise  ono  can  assume  tho  duration  of  a  tost  equal  to  the 
observed  maximum  timo  to  ropair.  The  observed  maximum  time 
to  repair  is  a  random  variable. 

If  one  wishes  to  terminate  the  tc3t  after  a  pronssignod 

duration,  one  would  like  to  have  all  repairs  accomplished 

j  jj 

by  this  timo.  Tho  confidence  of  this  happening  is  «  , 

whero  <f  is  the  probability  of  repair  in  tho  time  allowed. 
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In  most  cases  a  very  high  confidence  of  completing  all 
repairs  may  not  require  an  exceptionally  long  poriod  of 
time. 

In  most  cases,  naturally  occurring  failures  are  pre¬ 
ferable  to  induced  failures.  Induced  failures  would  usually 
be  usod  when  ono  wishes  to  study  a  particular  repair  opera¬ 
tion.  When  using  naturally  occurring  failures,  ono  must  be 
caroful  if  dealing  with  a  particular  ago  group.  It  is  vory 
possible  that  a  large  percentage  of  the  repairs  are  charac¬ 
teristic  of  that  age  group  and  therefore  bias  tho  measure 
of  repair  time.  It  is  important  to  get  as  uniformly  dis¬ 
tributed  on  ago  group  as  possible  when  measuring  MTTR,  In 
oases  where  this  ia  impossible,  such  as  with  a  nowly  fielded  ■ 

itom,  one  or  more  subsequent  samples  should  bo  taken  to  de¬ 
termine  tho  effect  of  age. 

The  duration  of  the  test  will  now  dopond  on  the  instan¬ 
taneous  failure  rate  and  sample  size.  The  instantaneous 
failure  rate  is  the  inverse  of  tho  moan  time  between  fail-  j 

A  ] 

ure.  The  sample  size  for  this  problem  may  be  several  times  \ 

] 

larger  than  tho  number  of  failures  we  wish  to  observe.  Even  j 

If  tho  distribution  of  time  to  failure  is  not  exponential,  ] 

j 

the  expected  number  of  failures  with  n  items  on  test  for  a  ] 

time  TQ  In  N  =  nT0/8.  The  instantaneous  failure  rate  may  be  j 


1  A  thorough  blend  of  ages  and  states  of  repair  mast 
be  assumed. 
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U30U.  a  a  r,n  approximation  oven  if  the  fniluro  ruto  in  not 
constant  an  lone  an  the  failure  rate  dooo  not  chungo  signif¬ 
icantly  during  tho  toot  Interval. 

The  toot  duration  would  thon  equal  TQ  +  whore 

^Dnax  ^-l0  WnXilnuiu  down  tiro  .  A  coat  trado-off  could  bo 
pe informed  in  the  nano  manner  an  in  Chapter  IV,  The  above 
method  require a  n/h  bo  large.  If  ono  cannot  oatiafy  thin 
condition,  tho  distribution  of  timo  botwoon  failures  must 
bo  conniderod,  and  a  sampling  duration  must  be  determined 
in  a  manner  similiar  to  that  in  Chapter  V,  Thi3  duration 
in,  of  courno,  modified  by  adding  Tj^nax> 

If  tho  noedod  estimates  are  not  available,  ono  muat 
guentimate  tho  parameters  and  dooign  a  plan  accordingly, 
knowing  that  a  further  sample  may  bo  noco3sary. 


CHAPTER  VII 


SUMMARY  AND  CONCLUSIONS 

Chaptora  III  thru  VI  havo  been  concerned  with  dotor- 
mining  sample  sizes  and  to-t  durations  for  any  typo  itom, 
system,  and  subsystem  in  goncral.  The  specific  itom  con¬ 
siderations  of  a  sampling  plan  such  a3  data  collection 
forms,  goographic  locations  of  the  samples,  and  training 
skill  of  the  data  collectors  have  not  been  omphisizod  in 
this  paper.  The  cost  considerations  mentioned  in  tho  text 
would  obviously  fall  in  this  category.  Thoso  considerations 
are  very  important  to  a  sampling  plan  and  further  work  should 
bo  continued  in  this  area. 

The  work  of  this  paper  may  be  furthor  extended  to  in¬ 
clude  plans  measuring  many  other  parameters  and  combinations 
thereof  and  may  includo  moro  goneral  considerations  of  dis¬ 
tribution.  Hopefully,  in  the  future,  tho  Army  would  be  able 

4*  A  A  AwVs  n  A  .»1  1  I  A  £  I'  /\  M  4-  4  MM  £  M  Jm  M  A  U  J  t.  .h  M  <■  <.  1 
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regulations  which  will  guide  the  equipment  proponent  in  the 
formulation  of  a  complete  data  collection  plan.  Until  then, 
individual  work  should  continue  in  accumilating  this  inform¬ 
ation. 
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